INEQUALITIES FOR NUMERICAL INVARIANTS OF SETS 

OF MATRICES 
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Abstract. We prove three inequalities relating some invariants of sets 
of matrices, such as the joint spectral radius. One of the inequalities, in 
which proof we use geometric invariant theory, has the generalized spectral 
radius theorem of Berger and Wang as an immediate corollary. 



1. Introduction 

Let M(d) be the space of d x d complex matrices. If A £ M(d), we indicate 
by p(A) the spectral radius of A, that is, the maximum absolute value of an 
eigenvalue of A. Given a norm ||-|| in C d , we endow the space M(d) with the 
operator norm \\A\\ = sup{|j^4v|j; \\v\\ = 1}. 

For every A € M(d) and every norm ||-|| in C d , we have p(A) < \\A\\. On 
the other hand, there is also a lower bound for p(A) in terms of norms: 

(1) \\A d \\ <Cp(A)\\A\\ d -\ where C = 2 d - 1. 

In particular, if p(A) < \\A\\ then \\A d \\ < 

Inequality (||) is a very simple consequence of the Cayley-Hamilton theo- 
rem. Indeed, let p{z) = z d — o~\z d ~ x + • ■ ■ + (—l) d o~d be the characteristic 
polynomial of A. Since p(A) = 0, 

d 

\\A d \\ <^\<Ti\ \\A\\ d -\ 
i=i 

Since the cr, are the elementary symmetric functions on the eigenvalues of A, 

kii<(^p(A)<<^W) \\A\r\ 

Therefore @) follows. 

The spectral radius theorem (for the finite-dimensional case) asserts that 

(2) p{A) = lim \\A n \\ l l n . 

n— >oo 

The formula above may be deduced from the inequality a s we now show. 



Since |[A n+m || < ||A n || ||A m ||, the limit in @ exists (see [|PS|, problem 1.98]); 
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let us call it r. Clearly, r > p(A). Applying (||) to A n in the place of A, using 
that p(A n ) = p(A) n and taking the 1/dn-power, we obtain 

||^dn||l/d7j ^ ^jX/dn ptj^A./d || J 4 n || X)/dn 

Taking limits when n — > oo, we get r < ^(A) 1 ^ r {d-l)/d^ that is, r < p(A), 
proving ^) . The author ignores whether this proof has ever appeared in the 
literature. 

Now, let X be a non-empty bounded subset of M(d). Define 
||X|| = sup \\A\\, = sup p(A). 

If n G N, we denote by X n the set of the products A\ ■ ■ ■ A n , with all A{ € X. 
Since ||X n+m || < ||X n || ||X m ||, the limit 

ft(X) = lim US"!! 1 /™ 

n— >oo 

exists and equals inf n ||X n || 1//n . Besides, it is independent of the chosen norm. 
The quantity 7£(X) was introduced by Rota and Strang [RS] and is called the 
joint spectral radius of the set X. For a nice geometrical interpretation of the 
joint spectral radius, see [IP^I (or [jP3f ). 

Our first main result is a generalization of (|2|) to sets of matrices: 

Theorem A. Given d > 1 , there exists C\ > 1 such that, for every bounded 
set X C M(d) and every norm \\-\\ in C d , 

||X d || < Cift(E)||£|| d - 1 . 

Our next result relates the joint spectral radius of X with spectral radii of 
products of matrices in X: 

Theorem B. Given d > 1, there exists C2 > 1 and k € N such that for every 
bounded set X C M(d), 

K(E) < C 2 max p(X^) 1 ^'. 

l<j<k 

Using theorem ^, we can extend the spectral radius theorem @: 

Corollary 1 (Berger-Wang generalized spectral radius theorem). If X C M(d) 
is bounded then 

ft(X) = limsu P/ o(E n ) 1 / n . 

n— »oo 

Proof. The inequality 7?- (X) > lim sup p(X ra ) 1//ra is trivial. Applying theorem |b| 
to X n and using that ft(X n ) = ft(X) n , we obtain 



ft(X) < C\ /n max p(X^) 1 /^. 
i<jf<fc 

Taking lim sup when n — > 00, we get the result. □ 
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The result above was conjectured by Daubechies and Lagarias [DL] and 



proved by Berger and Wang [BW]. Other proofs were given in |E| and [3WP] 



The proof of theorem |A] is elementary, while in the proof of theorem [B| we 
shall use some geometric invariant theory We also give another generalization 
of ([l]), proposition 12 below, whose proof is elementary. 

Remark. For all E and m, n G N, we have p{Y l mn Y/ mn > p(Y, n y/ n (because 
S nm C (S n ) m ). So in theorem [b] it is sufficient to take the maximum of 
^(E^) 1 /^ over j with k/2 < j < k. Another consequence of the latter remark 
is that 

limsu P/ o(S n ) 1 / n = sup / o(S n ) 1 / n . 

n— >oo nSN 

2. Proof of theorem [A] 
We first prove an inequality that is weaker than theorem [A]: 

Lemma 2. Let \\-\\ e be the euclidian norm in C d . There exists Co = Co(d) 
such that 

liss^^lle < c ||E||e uses -1 !!* -1 . 

for every non-empty bounded set E C M(d) and every S € GL(d). 
Proof. We shall also consider the norm in M(d) defined by 
||A|| = max lay |, where A = (oij)i,j=i,...,d- 
We first assume S is a diagonal matrix diag(Ai, . . . , A^), with Ai, . . . , A^ > 0. 



Take d matrices A±,... , A& € E, and write Ag = (af))- Then 



and 



ISES 1 || = maxlAja-f A- 1 | 

i,j,e ■* 



\SA X --- A d S 1 ||o < Co max | K^i • • • 4f_ lid Aj | 



where Cq = d d 1 . Given integers io, • • • , id G {!>••• > <^}j by the pigeon-hole 
principle there exists 1 < A; < d such that \ k _ x < Aj. . Therefore 

I A io • • • 4H x v, A i d 1 1 = II I A ^-l°ili^ A i^ I - 

l<^<d 

It follows that H^E^-^lo < C ||E||o || J SS J S' _1 ||^ -1 . Up to changing C , the 
same inequality holds for the euclidian norm ||- He- 
Next consider the general case S € GL{d). By the singular value decom- 
position theorem, there exist unitary matrices U, V and a diagonal matrix 
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D = diag(Ai,... , A^), with Ai,... , > 0, such that S = UDV. Since U 
and V preserve the euclidian norm, 

\\SV d S- 1 \\ e = \\D{V^V- l ) d D~ l \\ e < 

< CoWVEV^We WDVZV^D-if- 1 = Co\m\ e WSVS- 1 ^- 1 . 

This proves the lemma. □ 

To make the constant in lemma || independent of the norm, we will use: 

Lemma 3. There exists C = C(d) such that, for every two norms \\-\\\, \\-\\2 
in C d , there is S G GL(d) such that: 

1- < ||5u|| 2 < ||u||i, for all v G C d ; 

2. C^pHi < ||S , ^5" 1 || 2 < CP||i, for all A G M(d). 

Proof. The second part is an immediate consequence of the first one. To prove 
the first part, it is enough to show that for every ||-|| in C^, there is S G GL(d) 
such that 

(3) C^\\v\\ < \\Sv\\ < \\v\\, WGC d , 

where ||-||o is the sup- norm in C d and Cd = 2 d — 1. The proof is by induction. 
Let II -II be a norm in C d+1 . Restrict it to the subspace C d = C d x {0} C 
C d+1 . By induction hypothesis, there is S G GL(d) such that @ holds. If 
-Kj : C d+1 — > C is the projection in the j-th. coordinate, then \ttj o S(v)\ < \\v\\, 
for all v G C d and 1 < j < d. By the Hahn-Banach theorem, there are linear 
functionals Aj : <C d+1 — > C such that Aj\C d = ttj o S and |Aj(iu)| < ||u>||, 
for all w G C d+l and 1 < j < d. Let a = \\^d+i \\ an d define a linear map 

5 : C d+1 -> C d+1 by 



3 ~ ^ ~ } „-l 



TTj O S 



Aj if 1 < j < d, 



L -K d +\ if j = d. 

Then S G GL(d + 1) and ||5u>||o < ||u>||, so S satisfies the second inequality 
in dH). To prove the first one, let £ G C d+1 be such that ir d+ i(t;) = a and 
= 1. Write 5(f) = rj + e d+1 with rj G C d and e d+i = (0, . . . , 0, 1) G C d+1 . 
We have |M| < ||£|| = 1 and so HS" 1 ^)!! = WS^MW < C d \\rj\\ < C d . 
Therefore 

^(e^oii^neii + ii^^Hi+Q. 

Now let w G C d+1 be given. Write w = v + te^+i with w G C d and t G C. 
Then 

H^WH < US" 1 («)|| + |t| ||5- 1 (e d+1 )|| < QII^Ho + {C d + l)|t| 

< (2C , d + l)max{|H|o,|t|} = C , df i|Hlo- 
This proves that (^) holds with d + 1 and S* in the place of d and 5. □ 

The result below gives another characterization of the joint spectral radius. 
For a proof, see (E| or |RS|. 
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Proposition 4. For all bounded E C M{d), 

H(E) =mf ||E||, 

where the infimum is taken over all norms in C d . 

Proof of theorem [3]. Let Co and C be as in lemmas [2| and ||. Let ||-|| e be the 
euclidian norm, and let ||-||i, ||-||2 be any two norms in C d . Let Si, S2 € GL(d) 
be given by lemma |3] such that 

C'- 1 ||5' i AS'r 1 || e < \\A\\i < C\\SiASi l \\ e VA € M(d), i = 1,2. 

Take E C M(d). Then (applying lemma § with S = SiS^ 1 and S^ES^ 1 in 
the place of E) 

< CWS^S^We < CColl^S^-^ell^iS^f 1 ^- 1 < ^CollSllsllSH^ 1 . 

Taking the infimum over || • [I2 in the left hand side, we obtain, by proposition ||, 

< Ci^^HSHf- 1 , where d = C d C . □ 

Let us reread theorem |A| in terms of another invariant. Given a non-empty 
bounded E C M(d), we define 

(4) 5(E) = sup ifE^{0}, 

and 5({0}) = 0. The functions TZ(-) and S(-) are comparable: 
Proposition 5. 7£(E) < 5(E) < Ci^(S). 

Proof. The second inequality is theorem |A]. For any ||-|| we have ||E d || > 
7£(E) rf and so, using proposition ^j, 

5(E) > sup ||^- = tt(E). 

□ 

3. Proof of theorem |b] 

We shall need the following general result: 

Proposition 6. Fix d, I > 1. Let / : M(d) e — » [0, 00) 6e a locally bounded 
function such that, for every Ai, ... , A^ £ M{d), 

. f(SA 1 S- 1 ,... ,SA e S' 1 ) = f(A u ... ,A e ), \/S € GL(d); 
• /(L4 l5 ... ,L4 £ ) = ^/(Ax,... ,A £ ) ; Vt€C. 

T/ien i/iere exisi numbers k = k(d) £ N and C = C(d,£, f) > such that 

(5) /(Ai,... ,A £ ) < C7 max p{?P) l/j , where E = {Ai, ... ,A e }. 

l<j<k 

Let us postpone the proof of this proposition and conclude the: 
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Proof of theorem^. Let 5(-) be as in (||). Define a function / : M(d) d — > 
[0,oo) by f(A u ... ,A d ) =S({A U ... ,A d }). By theorem [Sj, /(E) < Ci||E|| 
(for any norm) - in particular, / is locally bounded. / also satisfies the other 
hypotheses of proposition [], thus there are k and C2 such that 

(6) 5(E) < C 2 max p(T, j ) 1/j , 

l<j<k 

for every E C M(d) with at most d elements. But 

5(E) = sup{5(E'); E' C E, #E' < d}, 

hence @ actually holds for every bounded E. Since 7£(E) < 5(E) (proposi- 
tion H), theorem |B] follows. □ 

A few preliminaries in geometric invariant theory are necessary to prove 



proposition y. Some references are [N3] and 



3.1. Polynomial invariants. Let V be a complex vector space, G be a 
group and 1 : G — > GL(V) be a linear representation of G. We shall write 
<7X = i{g){x). The or&ii of x G V is the set O(x) = {gx; j £ G). Let C[V] 
be the ring of polynomial functions <j) : V — > C. A polynomial £ C[V] is 
invariant if it is constant along each orbit, that is, 4>(gx) = 4>{x). The ring 0/ 
invariants, denoted by C[y] G , is the set of all invariant polynomials. 

For some groups G, called reductive groups, a celebrated theorem of Na- 
gata asserts that the ring C[y] G is finitely generated. We shall not define a 
reductive group; but some examples are GL(d), SL(d), PGL(d). We assume 
from now on that G is reductive. In this case, the theory provides an algebraic 
quotient of V by G with good properties: 

Theorem 7. Let 4>i, ... ,((>n be a set of generators of C[V] G . Letir:V^C N 
be the mapping x 1— > {<p\{x), . . . , (j)^{x)) € C . Then: 

0. 7r is G -invariant (i.e., constant along orbits); 

1. Y = 7r(V) is closed; 

2. 7r(xi) = vr(x2) if and only if the closures 0(x\) and 0{x2) have non- 
empty intersection; 

3. for every y EY, the fiber 7r _1 (y) contains an unique closed orbit. 

In the statement above, and in everything that follows, the spaces V and 
are endowed with the ordinary (not Zariski) topologies. Notice item 2 says 
that 7T separates every pair of orbits that can be separated by a G-invariant 
continuous function. 

Indication of proof. Let C(V) G be the field of G-invariant rational functions. 
It's easy to see that C(V) G is the field of quotients of C[y] G . Let tt and Y 
be as in the statement. Let Z be the Zariski-closure of Y, and consider tt 
as a function V — ► Z. Then tt induces a homomorphism tt* : C(Z) —> C(V) 
via / 1— > f o tt. One easily shows that tt* is an isomorphism onto C(V) G , so 
tt : V — > Z is an algebraic quotient in the sense of |Kj §11.3.2]. Therefore 



tt is surjective, that is, Y = Z, and item 1 follows. Items 2 and 3 are [N3, 
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corollary 3.5.2] and [0, bemerkung 1, §11.3.2], respectively. In the references 
above the Zariski topology is used instead. But this makes no difference here, 
by |0, §AL7]. □ 

Example. Let G = GL(d) act on V = M(d) by conjugation: l(S)(A) = 
SAS' 1 for S G G and A G V. Given A £ V, let a x (A), . . . ,a d (A) be the 
coefficients of the characteristic polynomial of A. Then a±,. . . , G C[V] . 
Moreover, these polynomials generate the ring C[y] G . Let it = (cr 1; . . . , er^). 
Then ir is onto C d . Every fiber Tr~ 1 (y) consists in finitely many orbits, each 
one corresponding to a different Jordan form. The closed orbits are those 
of diagonalizable matrices. The fiber 7r _1 (0) is the set of nilpotent matrices. 
(See 0, §1.3].) 

3.2. Topological considerations. Fix it and Y = n(V) as in theorem [7|. We 
endow the set Y C with the induced topology. The following theorem was 
proved independently by Luna |D and Neeman | N1 , corollary 1.6, remark 1.7] 



(see also ]N2|): 



Theorem 8. The topology in Y coincides with the quotient topology induced 
by it : V — > Y (i.e., U C Y is open if and only i/7r _1 (C/) is open in V). 



Corollary 9. The mapping ir : V — > Y is semiproper, that is, for every 
compact set L C Y there exists a compact set K C V such that ir(K) D L. 

Proof. Suppose that for some compact L cY there is no compact set K C V 
such that it(K) D L. Take compact sets K n C V such that K n C int K n+ \ and 
[J n K n = V. Then, for each n, there exists y n G L such that 7r _1 (y n )nK n = 0. 
Up to replacing (y n ) with a subsequence, we may assume that y = lim y n exists 
and y n U f° r each n. Then the set F = {y n ; n G N} is not closed in Y, but 
7r _1 (i ? ) = (J n 7r _1 (y n ) is closed in V, contradicting theorem ^. □ 

Let us derive a consequence of the above results: 

Lemma 10. If f :V —* [0, oo) is a G-invariant locally bounded function then 
there exists a locally bounded h : Y — * [0, oo) such that f < h o tt. 

Proof. Given x G V, let F x = tt^ 1 (tt(x)) be the fiber containing x. Set 

f(x) = inf { sup/|J7; U is a G-invariant open set containing F x }. 

(Here U U is G-invariant" means 0(x) C U for all x G U.) We claim that 
f(x) is finite for all x G V. Indeed, each fiber F x contains an unique closed 
orbit 0{xq), by theorem ^. Let Uq be a bounded neighborhood of xq; so 
sup/|C/o is finite. Let U = UzeC/o ^( x )'i then U is a G-invariant open set and 
sup/|£7 = sup/|f7o- Moreover, U contains F x : for every ^ G F x , we have, by 
theorem 0, G{£) n O(x ) / 0, hence O(^) n £7 / and ^ G f7. This proves 
that f{x) < sup f\U < oo. 

The function / : V — * M satisfies / > / and is also locally bounded. Since 
/ is constant on fibers, there exist : Y — ► R such that f = h o n. The 
function /i is locally bounded, because if L C Y is a compact set then, by 
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corollary |9], there is some compact K C V such that tt(K) D L and, in 
particular, h\L < (h o tv)\K = f\K < oo. □ 

3.3. ^-uples of matrices and end of the proof. From now on we set 
G = GL(d), V = M(d) d and 

i(S)(A l7 ... ,A e ) = (SA 1 S-\... ^AS- 1 ). 

In this case, a finite set of generators for C[y] G is known: 

Theorem 11 (Procesi |P1| , theorem 3.4a). The ring of invariants is gener- 
ated by the polynomials tr(A, 1 • • • A^.), with 1 < j < k, where k = 2 d — 1. 

We are now able to give the: 

Proof of proposition Let k = 2 d -l, N = £+£ 2 -\ \-£ k , and let ai, . . . , a/v 

be all the sequences a = (ii, . . . , ij) G {1, . . . , £} J of length \a\ = j, 1 < j < k. 
Let it = (<pi, . . . , 4>n) '■ y —> C N be given by 

(p i (A 1 , ... ,A e ) = tv(A h ■ ■ ■ A i:j ), where a* = (ii, . . . ,ij). 

Let Y = 7r(V). Define another function r : C N — > M. by 

t(zi,... ,z n ) = max{|^| 1/|ai '; 1 < i < N}. 

So if x = (Ai, . . . , Ai) then 

r(vr(x)) = maxjltr^, • • •,4 ij )| 1/j '; 1 < j < k, 1 < n, . . . ,ij <£}. 

Since |tr(^4)| < dp(A) for every A E M(d), we have 

t(tt(x)) < d max p(S J ') 1/j , where S = {Ai, . . . ,^}. 

Notice T{ii{tx)) = |t|r(7r(x)), for all t £ C. Let /i : Y — > [0, oo) be given by 
lemma [R]. Since -ftT = t _1 (1) is compact and Y is closed, Co = sup h\(Y C\K) 
is finite. Given x £ V, let t = t(tt(x)). If i ^ then 

/(x) = tfit^x) < th{n{t- l x)) < C t = C t(it(x)) < dC max p{Y?) xfi . 

l<j<k 

Let C = dCo; then (|5|) holds. If t = 0, that is, 7r(x) = 0, we argue differently. 
By theorem |7|, the orbit of x accumulates at 0. It follows from the hypotheses 
on / that /(0) = and / is continuous at 0. Therefore f(x) = and (||) 
holds. This completes the proof of proposition ^ and so of theorem [B]. □ 

4. Another inequality and some questions 

We shall prove another inequality, proposition |l2| below, which general- 
izes (||) and is also an elementary consequence of the Cayley-Hamilton theo- 
rem. 

We need some notation. For s > 1, let S s be the set of permutations of 
{1,2,... ,s}. Given a € S s , decompose a in disjoint cycles, including the 
ones of length 1: 

0"= {h ■ ■ ■ ik){ji ■ ■ ■ 3h) ■ ■ ■ {t\ ■ ■ ■ t e )- 
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Then, given matrices A%, . . . ,A S , we set 

...,A S )= tr(A h ■ ■ ■ A ik )tv(A n ■ ■ ■ A jh ) ■ ■ ■ tv(A tl ■■■A te ). 
Letting e{a) be the sign of <r, we define 

(7) F(A U ...,A S )=J2 e{<j)<S> a {Aii A.). 

a£S s 

Define also P{A\, ... , A s ) = ^2 ae s s ^o-(l) ' ' ' Ar(s)- The trace identity from 
PH , Corollary 4.4] (which follows from the Cayley-Hamilton theorem by an 
elementary process, see also ]F], §4]) is: 

d 

( 8 ) ^^(-l) s F(,4 n ,... ,A ls )P(A h ,... ,A Jd _J = 0, 

s=0 

where the second sum runs over all partitions of {1, . . . , d} into two disjoint 
subsets {i\ < . . . < i s } and {j± < . . . < jd-s}', it is understood that F(0) = 1 
and P{0) = I. 

Proposition 12. Given d > 1, there exists C > 1 such that for every opera- 
tor norm \\-\\ and every d matrices A\, . . . ,Ad £ M(d), we have 

||P(A 1 ,...A0||<C||£|| d - 1 maxp^') 1 ^. 

i<j<d 

where £ = {A\, . . . , Ad}. 

Proof. We estimate terms in (^), for 1 < s < d. If a is a permutation of 
{ii < . . . < i s } with cycles of lengths ki, . . . , /c^ then 

1$^,... ,A ls )\<C p(^)---p(^ kh ), 

where Co is a constant. The right hand side is < Co/o(£ fe ') 1//fe ' |jS|| s_1 , for any 
k{. Plugging this estimate in (0), we get 

\F(A h ,... ,A ls )\ < Collar 1 maxp(») 1/j . 

Using the inequality above and the obvious bound \\P(Aj i: . . . ,Aj d _ s )\\ < 
(d — s)l\\Y,\\ d ~ s , the result follows from (||). □ 



We do not know whether the methods of the proof of proposition 12 



can 

be improved to give an elementary proof of theorem |B|. Notice that if k in 
theorem [B| were equal to d then proposition 12 would follow from theorems [A] 
and ^. 

Question. What is the minimum k such that theorem |b| holds? Can one take 
k = d? 

The answer is yes when d = 2. The ring of invariants of two 2x2 matrices 
A\ and A2 is generated by tr^i, det^i, tr^2, det^, trAiy4.2, see jF], §7]. 
Since det^4 can be expressed as a polynomial in Xx A and ti A 2 , one can take 
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k = 2 in theorem 11, and so also in theorem ||, when d = 2. Moreover, since 
/?(£) < p(S 2 ) 1//2 , theorem [B] assumes the form: 

Using this inequality, it is easy to show that the sequence p{Y? n ) 1 l 2n con- 
verges. However, the sequence / o(S ri ) 1//n itself does not necessarily converge. 
We reproduce an example from [|Gj]: 

l\ /0 0\1 , _ Jo if n is odd, 

Oj ' \1 Ojj ^ P{ j ~|l if n is even. 

Acknowledgement. I would like to thank J. V. Pereira for helpful discussions. 
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